Scalable XSLT Evaluation
نویسندگان
چکیده
XSLT is an increasingly popular language for processing XML data. It is widely supported by application platform software. However, little optimization effort has been made inside the current XSLT processing engines. Evaluating a very simple XSLT program on a large XML document with a simple schema may result in extensive usage of memory. In this paper, we present a novel notion of Streaming Processing Model (SPM ) to evaluate a subset of XSLT programs on XML documents, especially large ones. With SPM, an XSLT processor can transform an XML source document to other formats without extra memory buffers required. Therefore, our approach can not only tackle large source documents, but also produce large results. We demonstrate with a performance study the advantages of the SPM approach. Experimental results clearly confirm that SPM improves XSLT evaluation typically 2 to 10 times better than the existing approaches. Moreover, the SPM approach also features high scalability.
منابع مشابه
. D B / 0 40 80 51 v 1 2 2 A ug 2 00 4 Scalable XSLT Evaluation ⋆
XSLT is an increasingly popular language for processing XML data. It is widely supported by application platform software. However, little optimization effort has been made inside the current XSLT processing engines. Evaluating a very simple XSLT program on a large XML document with a simple schema may result in extensive usage of memory. In this paper, we present a novel notion of Streaming Pr...
متن کاملA Scalable XSLT Processing Framework based on MapReduce
The eXtensible Stylesheet Language Transformation (XSLT) is a de-facto standard for XML data transforming and extracting. Efficient processing of large amounts of XML data brings challenges to conventional XSLT processors, which are designed to run in a single machine context. To solve these data-intensive problems, MapReduce paradigm in the cloud computing domain has received a comprehensive a...
متن کاملTowards Scalable Ontology Engineering Patterns: Lessons Learned from an Experiment based on W3CTMs Part-whole Guidelines
This paper presents an empirical evaluation of description logic reasoners to support the selection of scalable ontology engineering patterns for TBox reasoning. Our main objective is to define the rationale behind the design decisions required for the generation of large ontologies with XSLT-based tools. We discuss here the outcomes of an experiment focusing on aircraft components and parts fo...
متن کاملUsing XML Languages for Modeling and Web-Visualization of Geographical Legacy Data
In our aim to modernize geographical legacy data from the German office of geographical survey with XML languages, we first modeled the data in GML, a standard language for geography markup. The resulting model has then been used as a template for a JAVA application that assembles the necessary information from the legacy data and writes it into a GML document. During the second part of our wor...
متن کامل